THE MOBIUS FUNCTION AND DISTAL FLOWS 



JIANYA LIU & PETER SARNAK 

Abstract. We prove that the Mobius function is Unearly disjoint from an analytic skew 
product on the 2-torus. These flows are distal and can be irregular in that their ergodic 
averages need not exist for all points. We also establish the linear disjointness of Mobius 
from various distal homogeneous flows. 



1. Introduction 

Let 2^ = (T, X) be a flow, namely X is a compact topological space and T : X — )■ X 
a continuous map. The sequence ^(n) is observed in ^ if there is an / G C{X) and an 
X G X, such that ^(n) = f{T^x). Let be the Mobius function, that is /u(n) is if 
n is not square-free, and is (—1)* if n is a product of t distinct primes. We say that /i is 
linearly disjoint from ^ if 

^J]/iHe(r2)^0, asX^oo, (LI) 

n<N 

for every observable ^ of The Mobius Disjointness Conjecture of the second author 
asserts that /i is linearly disjoint from every ^ whose entropy is [TB], fTT]. The results 
for /u(n) in this paper can be proved in the same way for similar multiplicative functions 
such as \{n) = (— where r(n) is the number of prime factors of n. This Conjecture 
has been established for many flows ^ (see [5], [H], [9], [3], [2]) however all of these flows 
are quasi-regular (or rigid) in the sense that the Bikrhoff averages 



N 

n<N 



exists for every observed in In this paper we establish some new cases of the 
Disjointness Conjecture and in particular for irregular flows , that is ones for which 
flL2p fails. These flows are complicated in terms of the behavior of their individual orbits 
but they are distal and of zero entropy, so that the disjointness is still expected to hold. 
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Our first result is concerned with certain regular flows, namely afline linear maps of a 
compact abelian group X. Such a flow (T,X) is given by 

T{x) = Ax + b (1.3) 

where A is an automorphism of X and 6 G X (see [TU], [H]). 

Theorem 1.1. Let = {T,X) be an affine linear flow on a compact abelian group which 
is of zero entropy. Then fi is linearly disjoint from 2J . 

The flows in Theorem 11.11 are distal, and our main result is concerned with nonlinear 
distal flows on such spaces. We restrict to X = the two dimensional torus M^/Z^ and 
consider nonlinear smooth (or even analytic) skew products as discussed in Furstenberg 
0. T : ^ is given by 

T{x,y^ = [ax \ a,cx \ dy \ h{x)) (1.4) 

where a, c, G Z, ad = ±1, a G M and /i is a smooth periodic function of period 1. The 
affine linear part is in the form 

" a 



d 



G GL2(Z) 



ensuring that S has zero entropy (and it can always be brought into this form). The flow 
(T, T^) is distal and this skew product is a basic building block (with e{h{x)) continuous) 
in Furstenberg's classiflcation theory of minimal distal flows [7j. If a is diophantine, that 
is 

' a c 
a - - > — 
q 

for some c> 0,m, < oo and all a/q rational, then T can be conjugated by a smooth map 
of to its affine linear part 

{x, y) ^ {ax + a,cx + dy + (3) (1.5) 

where ^ 

/3 = / h{x)dx 
Jo 

(see [IS]). Hence the disjointness of fi from ^ = (T, T^) for a T with a diophantine 
a, follows from Theorem 11.11 However if a is not diophantine the dynamics of the flow 
(T, T^) can be very different from an affine linear flow. For example, as Furstenberg shows 
it may be irregular (i.e. the limits in (11. 2p fail to exist for certain observables) . Our main 
result is a proof that these nonlinear skew products are linearly disjoint from fi, at least if 
h satisfles some further technical hypothesis. Firstly we assume that h is analytic, namely 
that if 

h{x) = h{m)e{mx) (1-6) 

mGZ 
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then 



h{m) < e-"l"^l (1.7) 

for some r > 0. Secondly we assume that there is r2 < oo such that 

|/i(m)| > e-"^H. (1.8) 

This is not a very natural condition being an artifact of our proof. However it is not too 
restrictive and the following applies rather generally (and most importantly there is no 
condition on a). 

Theorem 1.2. Let = (T.T^) be of the form ([T^j, with h satisfying ([T^) and (C^j. 

Then fi is linearly disjoint from . 

Theorem 11.11 deals with the affine linear distal flows on the n-torus. A different source 
of homogeneous distal flows are the affine linear flows on nilmanifold X = GjV where 
G is a nilpotent Lie group and F a lattice in G. For ^ = (T, G/F) where T{x) = axT 
with a E G, i.e. translation on G/T, the linear disjointness of /j, and ^ is proven in [8j 
and |9] . Using the classification of zero entropy (equivalently distal) affine linear flows on 
nilmanifolds [1], and Green and Tao's results we prove 

Theorem 1.3. Let = (T, G/F) where T is an affine linear map of the nilmanifold 
G/T of zero entropy. Then /i is linearly disjoint from ^ . 

We end the introduction with brief outline of the paper and proofs. Theorem 11.11 with a 
rate of convergence is proved in §2. We first reduce to the torus case and then handle the 
torus case by Fourier analysis and classical results of Davenport and Hua on exponential 
sums concerning the Mobius function, which is stated as Lemma [2?T] in the present paper. 
The proof of Theorem 11.21 occupies §§3-6. The assertion of Thereom 11.21 holds for all a, 
and so we have to consider all diophantine possibilities of a. The case when a is rational 
is easy and this is done in §3. When a is irrational we have to distinguish three cases (A), 
(B), and (C), and the first two cases with rates of convergence are handled in §4 and §5 
respectively via different analytic techniques. The most complicated case (C) is studied 
in §6, and the tool for this is the Bourgain-Sarnak-Ziegler finite version of the Vinogradov 
method (see Lemma [6.21) . incorporated with various analytic methods such as Poisson's 
summation and stationary phase. Thus in case (C) we offer no rate. Furstenberg [6j gives 
examples of skew product transformations of the form fll.4p which are not regular in the 
sense of fll.2p . Many of the flows ^ in Theorem 11.21 have this property and we show in 
§7 that Furstenberg's examples are smoothly conjugate to such ^'s. In particular his 
examples are linearly disjoint from /i. By analyzing the structure of affine linear maps of 
nilmanifolds, Theorem 11.31 is reduced in §8 to a recent result of Green- Tao of polynomial 
obits on nilmanifolds (see Lemma 18.11) . 
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Throughout the paper there are various double exponential functions like e{e{f{n))) 
against the Mobius function where e{x) = as usual, and so we have to keep 
track of the dependence of each parameter very carefully. 

2. Theorem O 

2.1. Reduction to the toral case. We first reduce to the case that X is a torus (not 
necessarily connected), that is X = T*" x C = W/Z^ x C for some integer r > and C 
is a finite (abelian) group. Since the linear combinations of characters & T := X , the 
(discrete) dual group of X, are dense in C(X) it suffices to show that 

^ ^ /i(n)V'(T'^a;) ^ 0, as X ^ oo (2.1) 

n<N 

for every fixed x G X and ^ T. So fix G F and let be the smallest closed 
subgroup of r containing ip and invariant by A. Here we are denoting by T the affine 
linear map Tx = Ax + of X and A acts on F by A(j){x) = (j){Ax) for G F, x G X. If 
C is a subgroup of F let C"*- the annihilator of C be the closed subgroup of X given by 
C"*- = {x G X : c(x) = 1 for all c G C}. Set X^ to be the compact quotient group X/C^. 

By definition X/C^ = C^. For x G X and y G C^, 

T(x + y) = A{x + y) + b = Ax + b + Ay = Ax + b mod 

since is A-invariant. Hence T(x + y) = Tx mod C^, that is T induces an affine linear 
map of X^. Put another way the fiow = (T^,X^) is a factor of ^ = (T, X). 
Since we are assuming that has zero entropy it follows that so does This in turn 
implies that is finitely generated, as shown by Aoki (see P page 13). Being the dual 
of X^ it follows that X^ is isomorphic to T*" x C for some r > and finite C. Moreover 
for n > 

where in the last x is the projection of x in X^ and ip is the character on X^ induced by 
ip. In particular the observable ip(T"'x) on ^ is equal to ipiT^px) on Thus (12.11) will 
follow from the linear disjointness of the Mobius function from ^ ^. This completes the 
reduction to the toral case. 

2.2. AfRne linear maps on a torus. We have reduced Theorem 11.11 to the case that 
= (T,X) with X = W'/Z'' x C with C finite and T in the form ([L3D and of zero 

entropy. For our purpose of examining observables ^(n) in this fiow, we can "linearize" 
the fiow by doubling the number of variables. That is consider y = X x X and the linear 
automorphism W given by 

iy(Xi,X2) = (Axi +X2,X2). (2.2) 
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^ = {W,Y) is clearly of zero entropy since ^ is so, and the orbit W"'{xi,b) is equal to 
{T'^Xi,b),n > 1. Hence it suffices to prove Theorem 11.11 for such ^^'s. That is we can 
assume that ^ = (W,X) with X = M^/Z"* xF, F finite and is a linear automorphism 
of X of zero entropy. Either by noting that the induced action of on X must preserve 
IxF (since these are precisely the elements of finite order in X) or using the continuity of 
W to conclude that it preserves the connected component of in X (i.e. ]R™'/Z''" x {0}), 
we see that W takes the block triangular form 

Wiej) = iBe + Cf,Df) (2.3) 

where B : M'^/Z™' — )■ ]R'"/Z™ is an automorphism of this (connected) torus, C : F ^ 
]R'"/Z™ is a homomorphism and D : F — )■ F is an automorphism of F. The automorphism 
B lifts to a linear automorphism B of M"* which preserves Z"^, so that B e GLmi"^)- Since 
W has zero entropy so does B and it is known that this implies that B is quasi-unipotent 
[1]. That is, for some z^i > 1, B"^ = U is unipotent, or f/ = I + Ni with Ni nilpotent and 
/ the identity matrix. Also since F is finite it is clear that D'^'^ = I for some z/2 > 1. Let 
1/ = lcm(z/i, z/2). Then we have that 

W''ieJ) = {iI + N^)e + C^fJ) (2.4) 

where Ci is a morphism from F to M'^/Z™. In particular 

(^■=W'' = I + N (2.5) 

where N : X X satisfies A^^+^ = for some k > 0. Thus for g > an integer 

q , . min{k,q) . . 

^^ = E(!)^*= E (!)^- (2-6) 

Writing n > as n = gz/ + Z with < / < z/ we have 
and hence if x G X and n = qu + I then 

min(fc,g) , ^ min(fc,g) , ^ 

iy-x= y: nN^w^x= y: rM^ (2.7) 



t=0 ^ ' i=0 ^ 



where 



6,t = W'N'x. 
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(2.8) 



For q varying, q > k and ip & X fixed we have 

= mfi)m^Pm,2p ■ ■ ■m,kP ■ (2.9) 

The character ^p G X has the form ip : x i-^ e{{v,x)) for some v = (fi, . . . ,fm) G Z*^ 
where {v, x) means the dot product in M™, and hence the right-hand side of fl2.9p is 
e{Y{q)) where Y{q) is a polynomial in q with degree < k and coefficients depending on 
V and the ^'s. Changing variables from q to n hy n = vq + I with < / < z/ — 1, we see 
that Y{q) = (j){n) a polynomial in n with degree < k and coefficients depending on v, u, I 
and the i^'s. It follows that 



1=0 n<N 

n = l{ mod I/) 

Kn)e{<P{n)). (2.10) 



n<N 1=0 ri<N 

ri — i( mod I/) 



1=0 n<N 

n = l{ mod ly) 



Theorem 11.11 for {W,X) now follows from the following classical result proved by Dav- 
enport [5] for linear and by Hua [12] for (p nonlinear. This lemma will also be used in 
later sections. 



Lemma 2.1. Let u be a positive integer and < I < u. Let 

(f){u) = adU^ + a^-iu'^""^ -!-•■■ + aiu + ao 
he a real polynomial of degree d > Then, for arbitrary A> Q, 

Y « —4- (2.11) 

^ log N 

n = Z( mod i>) 

where the implied constant may depend on A and u, but is independent of any of the 
coefficients ad, . . . , ao- 

This can be established by Vinogradov's method or its modern variants, such as Vaughan's 
identity or Heath-Brown's identity. The estimate (12. lip with the congruence condition 
n = /(modi/), but with fi replaced by A the von Mangoldt function, was established in 
Hua [12], Theorem 10. 
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3. Theorem 11.21 with a rational 

3.1. Reduction. Without loss of generality we may assume that a = d = 1 m. (11. 4p . 
Thus 

T: (xi, X2) (xi + a, cxi + X2 + (3.1) 

where c G Z, a G M and /i is a smooth periodic function of period 1. Since the linear 
combinations of characters ^/^ G are dense in C(T^), it is sufficient to show that 

^ ^l{n)^l){T''x) = o{N), as N ^ 00 

n<N 

for any fixed a; G X and any fixed G T'^. Note that any G has the form tjj : x t-^ 
e{{b,x)) for some b = (&i,&2) ^ where {b,x) means the dot product in M^. Applying 
(13. ip repeatedly, we have T" : {xi,X2) {yi{n) , y2{n)) with 

yi{n) = Xi + na, (3.2) 
n(n — 1) 

2/2('T-) = c ^ a + cnxi + X2 + h[xi + ja). (3.3) 

i=o 

It follows that 

n— 1 

(6, y{n)) = biyi{n) + 62l/2(«) = P{n) + &2 ^ + j"), 

i=o 

where 

/ Tlifi 1) \ 

P{n) = bi{xi + no) + 62 I c a + cnxi + X2 I , (3.4) 



a polynomial of n with degree at most 2 and with coefficients depending on a,a;i,c, and 
b. Put 

5(iV) = 5^^(n)e((6,y(n))) 

n<Af 

= ^/i(n)erP(n) + 62X1^(^1 +-^"))- (3-5) 

n<N ^ jr=0 ^ 

Then the aim is to prove that 

S{N) = o{N) (3.6) 

for any fixed x = {xi,X2) G and any fixed b = (61, ^2) ^ which will be done in 
§§3-6. We may suppose that 62 7^ since otherwise (13.60 follows from Lemma 12.11 with 
u = I = 1 immediately. 
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Some of our results in §§3-6 actually hold for any smooth periodic h, not necessarily 
analytic. Suppose that /i : M — > M is a smooth periodic function with period 1. Then it 
has the Fourier expansion 

h{x) = h{m)e{mx), (3.7) 

which converges absolutely and uniformly on M, and its coefficients h{m) satisfy 

/i(m) <^ (|m| + 2)-^ (3.8) 

for arbitrary A > 0. We can transform S{N) by inserting the Fourier expansion (13. 7p of 
h. Thus, 

n— 1 n— 1 

/;,(xi + ja) = h{m)e{mxi) e{jma) 

j=0 mgZ j=0 

e{nma) — 1 



= h{'m)e{mxi)- 
e{nma) — 1 



eima) — i 

where we understand that 

: n for ma G Z. (3.9) 



e{ma) — 1 

This can happen only when a is rational. It follows that 

S{N) = y fi{n)e(p{n) + 62 V Mm)e(mxi) ''^'''""^ ~ . (3.10) 

3.2. The case of rational a. In this section we establish (13. 6 p for rational a. 

Proposition 3.1. Let S{N) be as in Ii3.5\) . and /i : M — )■ M a smooth periodic function 
with period 1. If a = a/q E Q, then 

S{N) <t:N\og-^N, (3.11) 

where A > is arbitrary, and the implied constant depends on A only. 

Proof By (KTO^ and flMjl . 



n<N 



S{N) = fi{n) i P{n) + h{qm)e{qmxi 



The last series over m is absolutely convergent, and its sum is a constant (3 depending on 
xi as well as a; = a/q. Hence 

S{N) = J2 Kn)e{P{n) + h^n), 

n<N 



from which and Lemma 12.11 the proposition follows. 



□ 



4. The continued fraction expansion of a 

4.1. The continued fraction expansion of a. From now on we assume that a is 
irrational, and our argument will depend on the continued fraction expansion of a. Every 
real number a has its continued fraction representation 

a = ao H ^— j — (4.1) 

where ao = [a] is the integral part of a, and ai, 02, . . . are positive integers. The expression 
(14. ip is infinite since a ^ Q. We write [oq; Oi, 02, . . .] for the expression on the right-hand 
side of (14. ip . which is the limit of the finite continued expressions 

[ao;ai,a2,...,afc] = aoH ■ — ^-^ (4.2) 

cti + 

as — 7- 00. Writing 



< , (4.3) 



— — [o-o! C^l) 02, ... , Ofc], 

we have Iq = Oq, h = ^oai + 1, go = Qi = di, and for k >2, 

h = (J'kh-l + 4-2, Q'fc = O.kQk-1 + Q'A:-2- 

Since a is irrational we have g^+i > q'fc + 1 for all > 1. An induction argument gives 
the stronger assertion that > 2^^^"^)/^ for all k > 2, and thus increases at least like 
an exponential function of k. The irrationality of a also implies that, for all k > 2, 

1 Ik 
< a 

'^QkQk+l Qk 

which will be used in our later argument. 

Let Q be the set of all qk with k = 0,1,2,...; note that qo = 1. Sometimes it is 
convenient to abbreviate qk to q, and g^+i to q~^. Let B he a large positive constant to be 
decided later. The set Q can be partitioned as U where 

Q' = {qeQ:q+ < g^}, Q» = {g G Q : g+ > g^}. (4.4) 

Lemma 4.1. Let /i : R — )■ M 5e a smooth periodic function of period 1. Then the following 
two series 

\Hm)\ \Hm)\ ^^^^ 

^—i ^ ll'^all ' ^ ^ ll^«ll 

g€Q g<m<g+ gCQ^ q<m<q + 

are convergent. 
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Proof. We first establish the convergence of the first series in (14 .SI) . By fl4.3p . a can be 
written in the form 

« = - + . ' V g e Q, (/,g) = 1, |7| < 1. 
q q{q+l) 

Therefore, for m = 1, 2, . . . , g — 1, 

ml m'-f ml 7' 
g q{q + l) q q+l 

and hence 

v ^ = V t 

^ llmall ^ 11^^^ + -i^ll 

m=l " " m=l II g q+l " 

We write m/ = r(modg) with 1 < |r| < g/2, so that the last denominator is 
y |r| |7'| _ (g + l)|r| - g|7'| ^ |r| 



q q + l + 1) "^IfJ+l) 

and consequently 

-2-1 



m=l II II l<r<q/2 



It follows that, for any positive t, 

V < f - + 1) 9' log g. (4.6) 



, ma vg 

m<t II II ^ 

gfm 

By (13.81) and partial integration, 

(j<m<(j+ I3<m<i3+ n n V ^ m<t 

<g^logg^ + l^rft < g"^+Mogg, 

and hence the first series in (14. 5 p is convergent. 

Next we consider the second series in (14.51) . Again by (14. 3p . 



m 

< 



2gg^ 



ma — m- 
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m 

< — - 



(IT 



Since g|m, we may write m = m'q, and hence the above becomes 



m' „ ,, m' 



< \\ma\\ < — -. 



2q+ " " g^ 

It follows that 



E E ^«?+logg+, (4.7) 



, ma — ' rri' 

q<m<q+ 'n^' I^Q'^ / Q 

q\m 



and the last term is <^ g log(g ) since q G Q . From this and f l3.8p we deduce that 

\h{m)\ 



q\m 



which proves that second series in (14.51) is also convergent. The lemma is proved. □ 

4.2. Transformation of the sum S{N). Lemma 14.11 can be used to understand the 
sum over m in (13.101) : it implies that the following two series 

y /imemxi / ^ \ 4.8 
^ ^ ^ ^ ^e(ma) - 1 

q<m<q+ ^ ' 

q\m. 

and 

J2 hMeimx,)^^^ (4.9) 

qGQ^ q<m<q + 
q I m 

are absolutely convergent. Denote by g{na + Xi) the sum of these two series, that is 

e(?T,ma) 



g{na + xi) = lY E + E E [M"^)e("^a;i)- 



^GQ q<m<q+ ^GQ'' g<m<g + 

qfrn q|m 



e(ma) — 1 

where (7 : M — t- M is a smooth periodic functions of period 1. It follows that 

(E E + EE (^^'^^''^'^''^^ ''e'ima) - 1 ^ ^^""^ ^ ~ ^^"^'^ 

g<m<q+ (JGQ'' q<m<q+ ^ 

gf m g | m 

Therefore the sum over m in (13.101) can be written as 



g{xi + na) - g{xi) + H{x) 
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(4.10) 



with 



H{x) = h{m)e{mxi 



eynma] 



q\m 



e{ma) — 1 



(4.11) 



Inserting these into fl3.10p . we have 

S{N) = e{-b2g{xi)) ^ /i(n)e{62^(na + Xi) + P{n) + h2H{n)} 



n<N 



with P as in fM . 

In the following we shall prove that the factor e{h2g{na + xi)} can be removed by 
Fourier analysis, and is hence harmless. Since : R — t- M is a smooth periodic function of 
period 1, we have the Fourier expansion 



e{h2g{u)) = a{m)e{ 



mu] 



(4.12) 



where 



aim) 



e{h2g{u))e{—mu)du. 



Since g{u) depends on the constant B in (14. 4p . we see that a{m) depends on 62 and B 
only. The series (I4.12p converges absolutely and uniformly in u G M, and hence 



'S'(A^) < ^{n)e{P{n) + b2H{n)} a{m)e{mxi + mna) 

n<N meZ 

— fi{n)e{b2H{n) + P{n) + mna} 



n<N 



<^ sup 



fi{n)e{b2H{n) + P{n) + mna} 



n<N 



(4.13) 



where the implied constant depends only on 62 and the constant B in (14. 4p . The polyno- 
mial P(n) + mna is harmless, but the complexity comes from H{n) which we deal with 
in the following subsections. 



4.3. Theorem 11.21 with a irrational. To estimate the right-hand side of (14.130 . we 
rewrite the function H in (14. lip as 

Hin) = J2 Fin; q) 



gee 
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^/ N f/ N / .e(nma) — l 

F(n;q)= > him)e{mxi)—. . (4.14) 

^ p mm] — 1 



g<m<g + 
q|m 



where 

F(n:n)= \ him^eim.xA' 

e{ma) — 1 

We want to truncate H{n) at Y, where Y is to be decided a httle later. Apphcation of 
(11 .yp gives, for \m\ > Y, 

h{m)e{mxi)—. < e ^^A^, 

e[ma) — 1 

and therefore 

= F{n; q) + 0(e~"^A^) =: F{n) + ©(e^'^iV). (4.15) 

geett 

If we set 

y = -logAr, (4.16) 

r 

then the last 0-term in (I4.15P is ^ A^~^, and hence (I4.13P becomes 

5(Ar) < l + sup|T(Ar)|, (4.17) 

where we should remember the implied constant depends only on 62 and 5, and where 
we have written 

T{N) = i2{n)e{b2F{n) + P{n) + mna}. (4.18) 

n<N 

Thus the estimation of S{N) reduces to that of T{N). 

Further analysis on F{n) is necessary. Recall that > q^ for any g G Q^. Also for 
any g G Q^, we have by (14. 3 p that 



m 

< 



ml 

ma 



m 
qq+ 



If it happens that q\m, we change variables m = qm' so that the above becomes 



m' „ , „ m' 



< \\m'qa\\ < — . (4.19) 



2q+ " " g^ 
For further analysis we write 

gt) = {mi,m2, . . .}. 

Noting that 

rrti < mf < < m2 < m,^ < m^ < . . . , (4.20) 
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we deduce that > mf > {mf)^ = mf^, and consequently 

m+ > (4.21) 

for all j > 1. The sequence fl4.20p should also be truncated at Y. Since mj — oo as 
j — cxD, there exists a positive integer J such that 

mj<Y < mj+i. (4.22) 

From this and fl4.2ip . we can bound J from above as 

IoSeSt log log log at 

J < . IT' + 1 < „ 4.23 
log B log iJ 

where we have used the definition of Y in (14.161) and therefore the implied constant 

depends on r. If we write q = mj in (I4.19P and change variables as m = m'mj, then 

, e{nm'mja) — 1 

where Mj := / nij for j = l,...,J— 1, but 

Mj := Y/mj. (4.25) 

In (I4.24P we have 

m' , m' 

< \\mmja\\ < — (4.26) 



Fjin) := Fimmj) = > himjm^eimjm Xi) — ; — (4.24) 

l<\m'\<Mj ^ ■' ' 



2m^ " '' " 



and if we write Qj = WrrijaW then the above with m' = 1 gives 



< ^7 < ^ (4.27) 

2mJ mJ 

for all j > 1. Hence (14.241) can be written as 

F,(n) = /(n%), (4.28) 

with 

fj{^)= z2 h{^j^)<^{'^j'^^i) ^5 xe[6j,6jN]. (4.29) 

We conclude that the function F{n) in (I4.18P is of the form 

F{n) = h{ne^) + ---fj{nej). (4.30) 
This is the expression from which we start to handle the factor e{b2F{n)) in (14.181) . 
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With fj as in fOOD we set 

$j = ^ \m\'^\h{mjm)\. (4.31) 

l<\m\<Mj 

Let C > be a large constant to be specified at the end of §6. We need to consider three 
possibihties separately: 

(A) m+$j < hg^^ N; 

(B) (m+)3 > ^jiVMog^AT; 

(C) m+<l>j > \og^^ N and [m+f < <^jN^log'^ N. 

In cases (A) and (B), the factor e{b2F{n)) will be handled by Fourier analysis and 
Lemma 12.11 while in case (C) by a finite version of the Vinogradov method (Bourgain- 
Sarnak-Ziegler |3j), as well as Poisson summation and stationary phase. 

4.4. Theorem 11.21 with a irrational: case (A). In this subsection we prove the fol- 
lowing proposition. 

Proposition 4.2. Let S{N) be as in Ii3.5\) . and h an analytic function whose Fourier 
coefficients satisfy the upper bound condition ( [i. ?[ ). Assume condition (A). Then 

S{N) < N{\ogNf^+^-^, (4.32) 

where A > is arbitrary, and the implied constant depends on A,t, and 62; but uniform 
in all the other parameters. 

We remark that the lower bound condition fll.81) is not needed in Proposition 14.21 

Proof. It suffices to bound T{N) defined as in fl4.18p under the condition (A). Our analysis 
starts from fi. Recall that 

f,{x)= V /i(mim)e(mimxi)4^^^T— T' xe[e,,e,N]. (4.33) 
^ — ^ e mfc'i — 1 

l<|m|<Afi ^ ' 

It is easy to compute the first and second derivatives of /i, that is 

f[{x) = 2Tci mh{mmi)e{mmiXi)— — ^— xG[6i,6iN], 

l<\m\<Mi ^ ^' 

and 

f'({x) = (27ri)^ 2J m^h{mmi)e{mmixi) ^ ^ , xe[6i,9iN]. 

l<\m\<Mi 

Trivially we have 



^ l<\m\<Mi ^ ^ 
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where the imphed constants are absolute. Note that 6(62/1(3^)) is a smooth periodic 
function on M, and hence can be expanded into Fourier series 



6(62/1(2;)) = y^^a{k)e{kx), 



where 



a{k) 



e{b2fi{x))e{—kx)dx. 



(4.34) 



(4.35) 



We must compute the dependence of a{k) on fi and 62- By partial integration we have 



a{k) 



2Txik 



e{b2fi{x))de{—kx) 



62 
k 

b2 



/ e{h2h{x))f[{x)e{-kx)dx 



Since 



27riA;2 
d{e{b2h{x))f[{x)} 



' d{e{h2h{x))f[{x)} 
dx 



e{—kx)dx. 



dx 



eib2Mx)){f^ix) + 2nb2f[ix)f[ix)}\ 
2 



$1 



< + 2nb2 



01 



$1 



01 



we can bound a{k) as follows 



Hk)\ < { |i 



$1 
01 



b^ 
"2 



for k ^ 0. Obviously for A; = we have \a 

'$1 

\a[K)\ ^ i + I 

feez 



\k\' 

< 1. It follows that 

2n 



$1 



< (462)'(^1 + ^J < (862)^(1 + m+$i)^ 

where in the last step we have applied J2\k\>i 1^1^^ < 4 as well as f l4.27p . 
Now we can remove the factor e{b2fi{n0i)) from any sum of the form 

J2f^in)eib2fiin0,) + Gin)) 



n<N 
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(4.36) 



(4.37) 



where G{n) is a function of n. Indeed, on inserting the Fourier expansion of 6(62/1(3^)), 
the above sum in absolute value can be written as 



jj,{n)e{G{n)) a{ki)e{nki9i 



n<N 



kit 



< ^ |a(A;i)| ^/i(n)e(nA;i^i + G(r2)) 



< (862)^(1 + sup 

fci,6»i 



^ I2{n)e{nkiei + G{n)) 



n<N 



by fl4.37p . In this way the factor e{b2fi{n6i)) has been removed. Of course, the same 
argument applies to 6(62/2), • • • , e(62/j), and hence (14.181) becomes 



where 



E = sup 



\T{N)\ < sn. 



yu(n)e{?2(/i;i6'i + ■ ■ ■ + kj9j) + P{n) + mna} 

n<N 



(4.38) 



(4.39) 



with the sup taken over a, m,ki, . . . , kj, 61, ... , 6j, and where 

J 

i=i 



The sum E above can be estimated by Lemma 12. 

S < iVlog-^iV, 



(4.40) 



(4.41) 



where the implied constant depends on A, but independent of all the other parameters. 
To estimate 11 we need to compute mf ■ ■ ■mj_^. From (14.201) we deduce by induction 



that {nij 



< TTij^^ for j = 1, . . . , J — 1, and therefore 



< (m 



j-i) ■ 



By definition there is a constant K >1 depending on r such that the inequality $j < K 
holds for all j. Hence 



j-i 
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and this can be used to bound 11 as follows: 

n = (862)'^(l+m+$j)2 JJ(l+m+$,)2 

i=i 

By (1421]) we have 

2Jlog(1662-fs:) 



if B is sufficiently large in terms of K and 62, that is in terms of r and 62- It turns out 
that for this purpose the choice B = 4[log(1662-^)] is acceptable, where [x] denotes the 
integral part of x. Note that fnj_-^ < rrij < Y with Y as in (14. 161) . These together with 
condition (A) give 

n < (1 + m+$j)2m^ < (1 + m+$j)V^ < (log A^)^^+^ (4.42) 

with the implied constant depending on r only. This is the desired upper bound for 11. 
Inserting fl02|) and fOB back into fOHjl . we get 

T{N) < Ar(logAr)8^+5-^, 

where the implied constant depends only on A and r. From this and fl4.17p we conclude 
that 



S{N) <t:l + N(\ogN) 



8C+5-A 



with the implied constant depends on A, r, and 62, but uniform in all the other parameters. 
This completes the analysis of case (A). □ 



5. Theorem 11.21 with a irrational: case (B) 

In this section we handle case (B). Still, the lower bound condition (11. 8p is not needed 
in Proposition 15.11 

Proposition 5.1. Let S{N) be as in Ii3.5\) . and h an analytic function whose Fourier 
coefficients satisfy the upper bound condition ( [i. ?[ ). Assume condition (B). Then 

S{N) < A^ log-^ A^ + A^(log A^)^-^ (5.1) 

where A > zs arbitrary and the implied constant depends on A, t, and 62 only. 

Proof. It is sufficient to estimate T{N) defined as in (I4.18P under the condition (B). We 
can repeat the argument in case (A) but with J there replaced by J — 1. Thus in (I4.18P the 
factors 6(62/1), 6(62/2), • • • , 6(62/7-1) can be removed by repeated apphcation of Fourier 
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S* = sup 



analysis, but the factor e(62/j) remains in the summation in f l4.18p . Hence, instead of 
f l4.38p . we have in the present situation, 

|T(A^)|<S*n* (5.2) 

with 

fi{n)e{n{hei + ■■■ + kj.iOj.i) + hfjinOj) + P{n) + mna} , (5.3) 

n<N 

where the sup is taken over a, m,ki, . . . , 9i, . . . ,9 Also similar to (14. 40 p . 

j-i 

n* = (862)'^^^'^ + m+$,)^ (5.4) 

where we note that 11* does not have any factor involving the subscript J. Similar to 
(1132]), we have 

n* < < (5.5) 

provided that B is sufficiently large in terms of r and 62- 

The estimation of S* requires more detailed analysis. We should take advantage of the 
fact that now 6j is very small. We write fj in (I4.29P in the form 

n-l 

fj{nej)= ^ h{mjm)e{mjmxi)'Ye{jmej), (5.6) 

l<\m\<Mj j=0 

where recall that Mj = Y/rrij by (14.250 . Taylor's expansion gives 



k=0 



{2tx ijmOjY 

and therefore 



n— 1 2 /r) • Q \k 

E -0-^^) = E ^^^^^ + 0{m^e^jN% 



k\ 

j=0 k=0 j=0 



Hence fl5.6ll takes the new form 



fj{nej) = Co{Mj)n+^c,{Mj)ejn{n-l) 

+]:C2iMj)9^j{n - l)n{2n - 1) + 0{cs{Mj)9'jN^}, (5.7) 




where 



CkiM) = — — — y m^himjm)e{mjmxi) 



l<\m.\<M 
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for /c = 0, 1, 2, while 

C3(M) = \m\^\h{mjm)\. 

l<\m\<M 

Obviously c^i^Mj) < Put = Cfc(oo) for = 0,1,2. Then by the upper bound 

condition fll.7p . 

\ck-Ck{Mj)\ < J2 H''\h{mjm)\ <^ J] mV™^*" 

\m\>Mj m>Mj 

« e-^^«iv-^ 

and therefore Ck{Mj) = + 0{N~*) for /c = 0, 1,2. Collecting these estimates back to 
fl5.7l). we have 



&2/jMj) = + 0{\h\N-') + 0{\h\Y^je'jN^) 

with 

1 1 

Q{n) = cqu + -Ci6jn{n - 1) + -C26']{n - l)n{2n - 1). 

Inserting these back into (15. 3p yields 



S* ^ sup 



fi{n)e{n{ki9i H + fcj_i6'j_i) + h2Q{n) + P(n) + mna} 



n<Ar 

|3 Ar5N 



+0(|62 1) + 0(|62|>^<f j^^jiV' 

where the sup is taken over a, m,ki, . . . , kj^i, 6*1, ... , 6'j_i. 

The condition (B) is designed to control the last 0-term, which is ^ A^(log A^)^"*" with 
the implied constant depending on r and 62 only. Applying Lemma [27T] again to the above 
sum over n, we get 

S* < log-^ A^ + Ar(log Ny-^' 

where A > is arbitrary and the implied constant depends on A, r, and 62 only. The 
desired result now follows from this and (15. 5p . □ 

6. Theorem 11.21 with a irrational: case (C) 

6.1. The result and the idea of proof. In this section we treat case (C) by establishing 
the following result. 

Proposition 6.1. Let S{N) be as in Ii3.5\) . and h an analytic function whose Fourier 
coefficients satisfying both the upper bound condition ( [i. 7| j and the lower bound condition 
(E^j. Assume condition (C). Then 

S{N) = o{N). (6.1) 
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In view of f l4.17p . it is sufficient to establish (16. ip for T{N) with 

n) + mna} 



n<N 



as in f l4.18p . Here we recall that P{n) is the polynomial of degree at most 2 as in (13 .40 . 
and F{n) = h{ndi) + ■■■ fj{ndj) with 

e{xm) — 1 



mjm)e[mjmxi^ 



e{m6j 



l<\m\<Mj 

as in fl4.30p and f l4.29p respectively. The tool of our proof is the following result of 
Bourgain- S arnak- Z iegler [3] . 

Lemma 6.2. Let / : N — t- C with |/| < 1 and let v he a multiplicative function with \v\ < 
1. Let T > be a small parameter and assume that for all primes pi,p2 < e^^'^,Pi 7^ P2, 
we have that for M large enough 



fiPim)f{p2m) 



m<M 



< tM. 



Then for N large enough 



n<N 



< 2A/rlog -A^. 



(6.2) 



(6.3) 



Lemma [6.21 reduces the estimation of T{N) to that of 
f{N) = Y e{b2F{din) - b2F{d2n) + P{din) - P{d2n) + di nma — d2nma} (6.4) 

n<N 

where di 7^ d2 are positive integers. Without loss of generality we assume henceforth that 
di > d2- Noting that 

b2F{din)-b2F{d2n) = {62/1(^1^1^^1) - 62/1 (^2^^^!)} + ■■ ■ 

+{b2fAd,nej)-b2fjid2nej)}, 

we can repeat the argument in case (A) but with J there replaced by J —1. Thus in (16.40 
the factors 

6(62/1), e(-62/i), • • • , e(62/j-i), e(-62/j-i) 
can be removed by repeated application of Fourier analysis, but the factor 

e{b2fj{dinej)~b2fj{d2nej)} 

remains in the summation. Hence instead of (I4.38p we have in the present situation 

\f{N)\ < EH (6.5) 
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with new definitions of S and 11. In fact in the above 



sup 



^ e{n{dikiei H h dikj_i9j_i - c/2/16'1 d2lj-i9j-i] 



n<N 



+b2fj{din6j) - b2fj{d2n6j) + P{din) - P{d2n) + {di - d2)mna} 



, (6.6) 



where the sup is taken over a, m, di, d2, ki, . . . , kj_i, h, . . . , Ij-i, 9i, . . . , 6'j_i. Also similar 
to flO0|) . 

j-i 

Il={8b2)'^'-''>l[{l + m+^,)\ 
i=i 

where we note that 11 does not have any factor involving the subscript J. Similar argument 
gives 

n < < (6.7) 
provided that B is sufficiently large in terms of r and 62- 
To handle S, we write fj{x) for fj{dix) — fj{d2x) so that 

fj{x)= kmjm)e{mjmx,) <dimx) - e{d2mx) ^^^q^^q^^^^ (gg) 

where recall that Mj = Y/mj by definition. We want to estimate S by Poisson's sum- 
mation formula and the method of stationary phase. To this end, we need to know the 
derivatives of fj{x). We are going to use the third derivative of fj{x), which is 

-(3), , , .,3 3-, , , ^dfe^dimx) — d2e{d2mx) 



ff\x) = {2mf m^h{mmj)e 



mmjXij 



, , e(m6 j) — 1 

l<\-m\<Mj ^ •' ' 

the reason for using the third derivative will be explained later. Since Qj x we have 

I 1/1 1^1 1 
X — r < — 

Wuj mj 

for |m| < Mj, and hence 



e{m6 j) — 1 = 27cim6 j 

It follows that 
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where 

= m^h{mmj)e{m'mjXi){d\e{dirax) — ^26(^2^3;)}. (6.10) 

l<\m\<Mj 

The polynomial is too long for a stationary phase argument, however the upper and 
lower bound conditions (11. 7p and (11.81) enable us to cut 0(x) at some fixed integer D. We 
will show in the following subsection that the choice 

D = [t2/t]+2 (6.11) 

is acceptable, where [x] denotes the integral part of x. 

6.2. The polynomials cj) and (po- We denote by 0d the part of with |m| < D, that is 
'Poix) = rn^h{rarnj)e{rnmjXi){d\e{dimx) — d\e{d2mx)}, (6-12) 

l<|m|<D 

and we want to approximate by this (f)D- By the upper bound condition (II. 7p . the tail 

(p — (pD can be estimated as 

(j){x) — (j)D{x) <ti dl m^\h{mmj)\ 

m>D+l 

<^ dl ^'e-™'"^ < dle-^^""', (6.13) 

m>D+l 

where the implied constants depend at most on r. Next we are going to prove that, when 
X is away from the zeros of (poix) by a small quantity 6, |0£)(x)| is away from by some 
quantity depending on 6. 

Lemma 6.3. Let P{z) he a complex polynomial of degree n defined by 

P{z)=Co + CiZ + ---+CnZ'', (6.14) 

and let Zi, . . . , Zn be the zeros of P{z). Let 6 be a small real number, and around each zj 
make a disc Dj = {z : |z — Zj\ < 6} where j = 1,. . . ,n. Let T denote the unit circle. 
Then for any z G T\{U'^^iDj} we have 

\P(z)\ > ( -] IIPII2, 



where 



n 



^|c,„n • (6.15) 



• in=0 

We remark that T\{U jDj} is the unit circle with some open arcs removed, and some 
of the removed open arcs may not contain any zero of P{z). The total number of these 
removed open arcs is at most n. 

23 



Proof. Suppose that \zj\ < 2 for j = 1, . . . , /c, while > 2 for j = /c + 1, . . . , n. Then 
we can write P{z) = Pq{z)Pi{z) with 

k n 
Po{z)=Cnl[{z-Z,), P,iz)= Y[{z-Z,). 
j=0 j=k+l 

First we note that a lower bound for |-P(2;)| follows directly from the construction of 
T\{U]^^Dj}, that is 

\P{z)\ > cj'\Pi{z)\, z e T\{U]^,D^}. (6.16) 
Next we compute the norms of P and Pq, getting 



0||2 



|Po(e(x))prfx = [\lT[\eix) - z,\Hx < c^S^'^ 
Jo Jo j-^o 



and 



||P||^ = f \P{e{x))\Mx<r^^M{z)\' f \P,{e{x))\Hx 
Jo ^eTT Jq 

< 432'=max|Pi(2)|l 
The last inequality combined with (16.161) gives 

Suppose max |Pi(2;)| is achieved at 2 = C G T. Then for any 2; G T we have 



max|Pi(2)| J-1 \C-zA - J-l \zA + l- 



k—n 



The desired result finally follows from this and (I6.17p . □ 
We want to apply the above lemma to (pD- Dividing ipD by e{—diDx), we have 

e{diDx)(j)D{x) = 0D,i(a;) - <I)d,2{x), (6.18) 

where, for i = 1,2, 

D 

<pD,i{x) = m^h{mmj)e{mm,jXi)e{dtmx + diDx). (6.19) 

m=-D 

Recall that we have assumed d\> d2. The norm of (pD,^ can be computed as 

110^,^2 = d^^ 
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with 

D 



^ \m\^\h{mmj)\A\ (6.20) 



-m=-D 

and therefore, by f l6.18p and the triangle inequahty, 

||0d||2 = - 0D,2||2 > ||0D,l||2 - ||0D,2||2 

= {dl - > $. (6.21) 

If we write z = e{x), then z lives on T and e{diDx)(j)£,{x) can be written as a polynomial, 
say P{z), in z with degree 2diD. An application of Lemma [6.31 to P{z) asserts that 

\Pi^)\>[^j \\Ph, zeT\{U]^,D,}, (6.22) 

where ||-P||2 is defined as in fl6.15p . Obviously ||-P||2 = ||0d||2- 

Under the map x ^ z = e{x), the pre-image of 2; G T fl {Wj^iDj} is a union of small 
intervals 

e<L 

where L < deg(P) = 2diD. Note that each Ii has length at most 26. It follows from 
flOTjl and IK2Tf that, for x G (0, l]\{Ui<Ll£}, 

\M^)\ > (3 j ^- 

Obviously $ > |/i(mj)|, which together with ( ]6.13p gives 
1 1 f6\ ^'^^^ 

-\M^)\ - |0(x) - M^)\ > o o - d\e'^'''^^. (6.23) 



2'" ' ' " ' " - 2 V3^ 

The lower bound condition (11. 8p implies that \h{mj)\ ^ e^'^^"*-', and hence the right-hand 
side of (I6.23P is positive provided that 

dl<=^r-\ e^-^-^^)^-^. (6.24) 

In view of (16. lip and (I4.22p . the exponent {tD — T2)mj approaches infinity when N — )■ 00. 
Suppose that (I6.24p is satisfied. Then, for x G (0, l]\{U£<Lle}, 

|0(x)| > ^\M^)\ + Q|0z,(x)| - |0(x) - M^)\] > if^V'^'^'f- (6.25) 
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Next we formulate (16.25p in terms of the function (j){x9j) with x E {0,9j^]. Write 
x9j = ^ and 

Je = 0-/h, (6.26) 

that is each is an amphfication of U by 6^^. Note that the length of each is < 26j^5. 
Hence flOSj) implies that, for x e (0, 6'7^]\{U^<l JJ, 



1 /5 



2diD 

mxej)\>-\^-j <f. (6.27) 

This will be used in the following subsection. 

An upper bound of \(f){x9j)\ will also be necessary. Since the right-hand side of (16.231) 
is positive, we have 

\m\ < \Mx)\ + \m - Mx)\ < l\Mx)\- (6.28) 

Also, by (16.121) and Cauchy's inequality, 

|0d(x)| < 2dl \m\'^\h{mmj)\ < 2rf?(2D)^<l>. 

\m\<D 

Combining the above two inequalities, we get 

\(l){x9j)\ <6dlD^^ (6.29) 
for any real x. The above upper bound will also be used in the following subsection. 

6.3. Application of Poisson's summation and stationary phase. In this subsection 
we estimate S in (16. 6 p by Poisson's summation formula and stationary phase. The follow- 
ing lemma of van der Corput (see for example Iwaniec and Kowalski |T3], Corollary 8.19), 
in particular, will be applied. 

Lemma 6.4. Let b — a>l. Let F{x) be a real function on {a,b) such that 

A < \F'^^\x)\ < r]A (6.30) 
for some A > and f] > 1. Then 

e(F(n)) < 7]^A^b - a) + A'^b - a)^, 

a<n<b 

where the implied constant is absolute. 

Proof of Proposition \6.1[ The sum S in (16.61) can be written as 



S = sup 



n<N 
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(6.31) 



with 

E{x) = x{diki6i H h dikj_i6j_i - ^2/1^1 6/2/7-16' j_i) 

+b2fj{x9j) + P{dix) — P{d2x) + (c/i — d2)mxa, 

where the sup is taken over a, m, di, ^2, ki, . . . , /cj-i, h, ■ ■ ■ , /j-i, 9i, . . . , 9j_i. If we take 
the third derivative of E{x), then all the quadratic and linear terms in E{x) will be killed, 
and the argument will be clearer. This is the reason for taking the third derivative of 
E{x). Thus dSH) and ([S3D imply that 

E^'\x) = h2ff\xej)e] = -{2'nejfh2[i + o(^-^^^0(x^j). (6.32) 

Recall that in case (C) we have mj $j > log^*" A^. We need to handle the following two 
possibilities separately: 

(CI) < N- 
(C2) m+ > N. 

Case (CI). In this case we will first conduct our analysis on the subinterval (0,6*^^] C 
[1, N]. The set (0, 6^^]\{Ui<LJi\ consists of at most L + 1 intervals, and we suppose (a, h) 
is any one of them. On this interval (a, h) we apply fl6.27p and (16.291) to get 

pe]^ < l^(^) [x) I < d\e]^ (6.33) 

with 

(3 = (5/3)2''i^, (6.34) 

where the implied constants depend on &2 and D only. This means we can take A = /36'}$ 
and r] = P~^dl in Lemma [6.41 which implies that 

e(E(r2)) </3-^c/2(^2^)i(6-a) + (/3e2^)-^(6-a)5 + l, 

ng(a,fe) 

where we have added a 1 on the right-hand side to cover the case b — a < 1. Summing over 
all these possible intervals (a, b) C (0, Oj^]\{Ue<LJi}, which are at most L + 1 < 2diD + 1 
in number, we get 

J2 eiE{n)) <^ (3-~'dl{9j<!>)kj^ + di{(3ej^)--e9f' +d,, (6.35) 
"6(o,eji]\{uja 

where the implied constants depend on 62 and D only. The length of each interval Jg is 
<^ 9j^6 by (I6.26p . and hence trivially 

J2 e{E{n)) « ej'6. 

27 



The number L of these intervals is at most 2diD, and consequently 



J2 <E{n)) < die-/5, 



which together with fl6.35p yields 



J 



di + diQ-j^S 



(6.36) 



with the implied constant depending on 62 and D only. 

Now we come to the estimation of S. Since E{n) is a periodic function with period 
we cut the interval [l,iV] into two smaller ones \i^9^^K] U (6'J"'^i^r, A^], where K = [9jN] 
and [x] means the integral part of x. The main interval and the tail interval 

{6j^K, N] must be treated differently, and therefore we split fl6.3ip as 



S < So + Si 



(6.37) 



where 



So = sup 



Si = sup 



ne{ej'^K,N] 



with the sup having the same meaning as in 06.311) . The main interval [l,9j'^K] consists 
exactly of K periods of E{n), and (16.361) can be applied to give 



ne[l,dj^K] ne(O,07^] 

< l3-hl{e'^j^)^N + di{/3e]^)-k]N + diOjN + di5N. 



(6.38) 



The first term on the right-hand side of (16.381) is bounded from above by <^ dl/3~36jN 

_i 1 _i 

with the implied constant depending on r only, and the second by diP 6 6'j$ 6A^, which 
also dominates the third term. Therefore the third term can be erased. 

We want to replace the $ by $j in the second term of (16.381) . The definitions (16.201) 
and (I4.3ip trivially imply 



$2 < 



l<|m|<A/j 



\m\^\h{mjm)\'^ < $j. 
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In the other direction, we have 

$^ < 2Mj ^ \m\^\h{mjm)\'^ 

l<\m\<Mj 

< Mj ^ |m|^|/i(mjm)p = Mj$2 

l<lm|<i? 

by Cauchy's inequahty as well as an argument similar to (I6.28P to compare the above 
longer sum up to Mj with the shorter one up to Z^. Therefore we can substitute the $ in 
the second term of (16.381) by $j, getting 

So < d\l3~-^ + dif3~^ , + di5N (6.39) 

(mj)3 (mj$j)6 

where the implied constant depends on b2,T,T2 only. We multiply 11 with Sq, and then 
apply the bound (16. 7p to get 

~ ~ 1 NY^ 1 NY-^'^ 

Son < dlr-'-—r + d,(3- \ ^ . + d^nv>j5N, 
{mj):i (mj$j)6 

where the implied constant depends on 62,^, T2 only. It should be remarked that to the 
last term on the right-hand side above, the bound 11 < mPj has been used instead of the 
crude bound 11 < y^. 
Now we specify 

S = 3m7^° (6.40) 

so that (16.341) implies that 
and hence 

Son « \ , + \ , + (6.42) 

(mj)3 (mj$j)6 mj 

where the implied constant depends on T2 only. 

We must check that our choice of 5 in (16.401) makes the inequality (I6.24p meaningful, 
that is (I6.24P does not confine di to a finite interval. This can be seen easily from the fact 
that, under (16.400 . the right-hand side of (I6.24p equals 

^(TD-T2)mj 

which clearly approaches infinity as mj — )■ 00, that is as iV — )■ 00. 
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For fixed di we take C = 20diD + 20. Then the assumption mj ^ mj$j > log^'^iV 
implies that the right-hand side in f l6.42p approaches as — i- oo. 

Sofl = o{N) (6.43) 

as — 7- oo. This completes our analysis concerning the main sum Sq. 
To bound the tail sum Si we note that 

J2 e{E{n))= J2 ^(^W) (6.44) 

by periodicity of E[n). The length of last sum is N — 6J^K < 6J^, and therefore it is 
covered by case (C2). The argument in case (C2) below will give 

Sifl = o{N) (6.45) 

as — )■ oo. 

We conclude from ([63]), (lOTI) . (ICT]) . and (16:451) that 

|f(A^)| < Sofl + Sin = o(Ar), (6.46) 

which in turn proves that T{N) = o{N) by Lemma 16. 2[ The desired result for ^(A^) 
follows from (14.171) . and this finishes the analysis in case (CI). 

Case (C2). This is similar to the proof of (I6.43P in case (CI), and only minor modi- 
fications are necessary. In the present situation we start the analysis on [1, A^] directly, 
instead of on {l,6j]. Thus (16.361) takes the form 

e{E{n)) < l3-hl{e'^j^)^N + di{l3ej<!))-^ m + di + diN5. 

n<N 

As before we multiply by H, apply the bound (16. 7p . replace $ by $j, and take S as in 
(I6.40p . Then we have (16.411) . and hence instead of (16.420 we have 

|f(A^)| < sn 

« ^---^ + ^^^ 1 + (6.47) 

(m+)3 $6 mj 

For fixed di we take C = 20(iiZ} + 20 as before. The first and third terms on the right-hand 
side are the same as in ( 16.42p . and can be handled in the same way. To bound the second 
term on the right-hand side of (16.470 . we write 

(mt)5 1 (mt)i 



$6 (m)$j) 24 ^8 
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To the term (mj $ j) 24 in the denominator we apply the first assumption mj $ j > log^*" N 
in case (C), while to the term {mj)s in the numerator we use the second assumption 



+^3 < $ jN^ log'^ N in case (C). Thus (lOHll becomes 



,.1 , c 



(mt)3 1 <l>*,A^2log8A^ ^1, c 
^ V < — g < log~^ A^. 

$6 loge iV $j 

This ensures that the second term on the right-hand side of (16.471) approaches as — 00. 
This proves that T{N) = o{N) and hence T{N) = o{N) by Lemma 16.21 again. This 
completes the analysis in case (C2). 

Proposition 16.11 is finally proved. □ 

Proof of Theorem li.E Theorem 11.21 follows from Propositions 13. H 14. 2[ 15. H and 16.11 □ 

7. DiSJOINTNESS OF FROM FURSTENBERG'S SYSTEM 

7.1. Furstenberg's example. Furstenberg gave an example of smooth transformation 
T : — > such that the ergodic averages do not all exist. Let a be as in §4.1 such that 

qk+i-e^"'' (7.1) 

with T as in (II. 7p . Define q-k = qu and set 

Kx) = Y."-^^^4^<<i^-)- (7-2) 

It follows from (14. 3 p and (17. ip that h{x) is a smooth function. We also have h{x) = 
g{x + a) — g{x) where 

SO that g{x) G L'^{0, 1) and in particular defines and measurable function. But g{x) cannot 
correspond to a continuous function, as shown in Furstenberg [6j. 

7.2. The Mobius function is disjoint from the Furstenberg example. It is enough 
to prove that a smooth conjugation of Furstenberg's dynamical system above satisfies the 
conditions of Theorem 11.21 To this end we introduce another function 

H{x) = 'YH{m)e{mx), (7.4) 

where 

H{m) = e-2"'™l. (7.5) 
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Obviously H{x) = G{x + a) — G{x) where 



We claim that G{x) is smooth, and this can be proved by the the argument in Lemma HTTl 
In fact by (14. 6p for any positive t, 



5^^^^ (- + l)g^loggfc, 



and hence partial integration yields 



^-^ \\ma\\ /„, I ^-^ \\ma\\ I 

5j.<m<g^_,_j^ II II yfc m<t H H ■' 

POD 

•J ai- 



On the other hand, by (14.71) . 



^-^ I mall 



ifc<'"<'jfe+i 

9fcl™ 



which together with f l7.ip gives 



< e-2-5'=gfc+i loggfc+i < e'^^'^g^. 

I mall 



'jfe<'"<'jfe+i 
■jfci™ 

These prove that the series in (17. 6p is absolutely convergent, and hence G{x) is continuous. 
In the same way we can prove that G{x) is even smooth. 
Now we add h to H so that h + H is smooth, and also 

h{x) + H{x) = {g{x + a) + G{x + a)} - {g{x) + G{x)}. (7.7) 

However g{x) + G{x) cannot be a continuous function, since G{x) is while g{x) is not. 

In the following we want to check that h{x) + H{x) satisfies the upper bound and lower 
bound conditions (11.70 and (ll.Sp of our Theorem 11.21 The m-th Fourier coefficient of 
h + H is 



k 



H{m) + ^— ^p^, if m = g^- 
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The case m qk is obvious. To check the case m = qk, apply (14.31) and (17. ip to get 

|1 - e{qka)\ _ 1 _ 1 
k kqk+i ke^'ik ' 

which in combination with (17. 5p yields 

^^^^ k ke^i" 

Thus the Fourier coefficients oi h + H satisfy (II. 7p and (11.80 . and therefore Theorem 11.21 
states that the Mobius function is disjoint from the flow defined hj h + H. 



8. Theorem O 

For a review of preliminaries of nilmanifolds, the reader is referred to the Appendix §9. 

8.1. Structure of afRne linear maps. We begin with the structure of affine linear 
maps. By §2.4 in particular Theorem 2.12 in Dani [Ij, any affine linear map T of G/T 
can be written as 

T = TgOa (8.1) 

where Tg is the action of (7 G G on G/T, a is an automorphism of G such that cr(r) = F, 
and a : G/T — )■ G/T satisfies a{xT) = a{x)T. It follows that 

T{xT)=Tg{W{xT)} = ga{x)T, 

and by induction 

T'^{xT) = ga{g)---a^-\g)a^{x)T. (8.2) 

We remark that (18. 2p itself is not enough to give a proof of Theorem II. 31 since the number 
of factors on the right-hand side of (18. 2p depends on n. 

8.2. Application of zero entropy. To prove Theorem II. 3[ we need the fact that the 
flow ^ = (T, X) has zero entropy. The main reference concerning the dynamics here is 
Dani's review article [4], Chapter 10. In this setting the flow has zero entropy if and only 
if it is quasi-unipotent. So the aim is to prove Theorem 11.31 for such flows. 

We need some words to clarify the definition. Let T = T^, o a be as in (18. ip . If all 
the eigenvalues of the differential da : q q are of absolute value 1, then we say that T 
and a are quasi-unipotent according to §2.4 in Dani [1]; this holds if and only if all the 
eigenvalues are roots of unity. Further, when G is simply connected, the factor of a on 
G/[G,G] is a linear automorphism and the proceeding condition holds if and only if all 
the eigenvalues of the factor are roots of unity. 
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Let X = {Xi, . . . ,Xr} be a basis for the Lie algebra g, and for a; G G let ipe^pi^) = 
. . . , Ur) be the coordinates of the first kind. Then a{x) can be computed by applying 
( 19. ip in the Appendix as follows: 

a{x) = (T{exp{uiXi + ■ ■ ■ + UrXr)} 

= exp{{d(T){uiXi + ■ ■ ■ + UrXr)}. 

Since da is quasi-unipotent, we may assume that the matrix U of da under X is quasi- 
unipotent, and hence 

{da){uiXi + ■ • • + UrXr) = {Xi, . . . ,Xr)Uu, 

where u denotes the transpose of the row vector {ui, . . . , Ur). It follows that 

{da)''{UiXi + ■■■ + UrXr) = (Xi, . . . , XrWu, 

and therefore 

cr"(a;) = exp{(c/o-)"(uiXi H h UrXr)} 

= exp{(Xi,...,X,)f/'^M}. (8.3) 

Since U is quasi-unipotent, U is a triangular matrix with its diagonal entries being 
roots of unity. It follows that there is a positive integer u such that 

= I + N (8.4) 

where / is the identity matrix and is nilpotent. From now on we let u denote the 
least positive integer such that (18.41) holds. For any n, we can write n = qu + I with 
< I < u — 1, and therefore we can compute U"' as 

min(q,r— 1) 

/ n \ 

It follows that 

U^u = y (8.5) 

where y denotes the transpose of the row vector {yniig), ■ ■ ■ ^ynril)) and each ynkil) is a 
polynomial in q with coefficients depending on U,x,i', and 1. Of course degynk <r — l 
for aA\ k = 1, . . . ,r. Inserting ( 18. Sp back to ( 18. 3p . we have 

a"(x) = exp{yni{q)Xi + ■■■ + ?/„,(g)X,}, (8.6) 

or, in the notation of ^ipexp, 

V^exp(cr"(x)) = (y„i(g), . . .,ynr{q))- (8.7) 

Similar results holds for ipexp{(^'' (g)) with g E G and j = 1, . . . , n — 1, that is 

V'exp((7^(^)) = iVjiiq), ■ ■ •,%>(?)) (8.8) 
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min(q,r-l) , , 

= u-'Q+i = u\i + xy = J2 



where each t/jkiq) is a polynomial in q with degree < r — 1 and with coefficients depending 
on f/, (7, u, and /. In the special case j = the above just reduces to the coordinates ipcxpio) 
of g. 

Now we apply Lemma 19.21 in the Appendix n times, so that the above analysis gives 

^^Je.A9^{9) ■ ■ ■ a"-^(^7)a"(a;)} = . . . , ^(g)) 

where Yi(g), . . . , Yr{q) are real polynomials in q with bounded degrees (which are actu- 
ally Or(l) with the 0-constant uniform in other parameters) and with their coefficients 
depending on U,x,g,h', and /. 

By Lemma 19.11 in the Appendix we can transform the coordinates of the ffist kind to 
those for the second kind. Apply ^|J o ip^J^ to the above equality, 

n9cr{g)---a-\g)a-{x)} = (V^ o ^."^(yKg), . . . , ^(g)) 

= {Zi{q),. . . ,Zr{q)), 

or 

ga{g) ■ ■ ■ a^-\g)a^{x) = exp{Z,{q)X,} ■ ■ ■ exp{Z,(g)Xj, (8.9) 

where Zi{q), . . . , Zr{q) are real polynomials in q with bounded degrees and with their 
coefficients depending on U, x, g, z/, and /. 
For each j = 1 , . . . , r we may write 

Zjil) = Cjeq'^ H h Cjiq + cjo, 

where i = deg Zj and the coefficients reals. Recalling that n = qu + I with 

< / < — 1, we may write Zj{q) as a polynomial in n as follows 

Zj{q) = c'jy + ■■■ + c'j^n + c'jQ, 

where c^-^'s are real coefficients depending on U,g,x, z/, and /. It follows that 

exp{Zj{q)Xj} = exp{c'jfXjn^) ■ ■ ■ exp{c'j^Xjn) exp(c^.o) = bf^ ■ ■ ■ b'^^bjo 
with bji = exp{c'j^Xj) etc. Inserting these into fl8.9p . we see that 

ga{g)---cr-~\g)a-{x) = b',^^-'^...b':^^-'^ 

where bi, . . . ,bk & G and hi, . . . ,hk are integral polynomials in n. Here it is important to 
note that k does not depend on n. Thus (18. 2p becomes 

T"(xr) = ga{g)---a''-\g)a^{x)r 

= 6;^i(")...6^^(")r. (8.10) 

Compared with (18.21) . this has the advantage that the number k of factors on the right- 
hand side is independent of n. This fact will be important for the following lemma to 
hold. 
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Lemma 8.1. Let u be a positive integer and < I < u. Let G/T be a nilmanifold and 
f : G/r — > [—1,1] a Lipschitz function. Let bi,...,bk G G and hi,...,hk be integral 
polynomials in n, where k does not depend on n. Then, for any A > 0, 

J2 /iW/(&?'^"^---&^^"^r) <iviog-^iv (8.11) 

n<N 
n = l{ mod f) 

where the implied constant depends on G, T, T, /, x, v, and A. 

Lemma [8. II can be established in the same way as Theorem 1.1 in Green- Tao |9], where 
the case v = 1 = 1 is handled. Now a proof of Theorem 11.31 is immediate. 

Proof of Theorem \1.3[ Recall that u is the least positive integer satisfying (18. 4p . that is 
u is fixed. Then each n G N can be written as n = uq + I with < I < u — 1, and our 
original sum takes the form 

J2f^{n)f{T-{xr)) = J2 H MfiT'^ixT)) 

n<N 1=0 "<iv 

n — l( mod 

1=0 n<iV 
n — l( mod 

by (I8.10p . Applying Lemma I5TT] to the last sum over n, we get 

J2 Kn)fiT''ixT)) <^ Nhg-^ N 

n<N 

where the implied constant depends on G, F, T, /, x, u, and A. Theorem 11.31 is proved. □ 
9. Appendix: preliminaries on nilmanifolds 

9.1. Nilmanifolds. Let G be a connected, simply connected nilpotent Lie group of di- 
mension r. A filtration G, on G is a sequence of closed connected groups 

G = Gq = Gi D ■ ■ ■ D Gd ^ Gd+i = {idc} 

with the property that [Gj, Gk] C Gj+k for all j, k >0. Here [H, K] denotes the commuta- 
tor group of H and K. The degree d of G, is the least integer such that Gd+i = {idc}- We 
say that G is nilpotent if G has a filtration. If F is a discrete and cocompact subgroup of 
G, then G/T = {gT : g G G} is called a nilmanifold. We write r = dimG and rj = dimGj 
for j = 1, . . . ,d. If a filtration G, of degree d exists then the lower central series filtration 
defined by 

G = Gq = Gi, Gj-^-i = [G, Gj] 

terminates with Gg+i = {idc} for some s < d. The least such s is called the step of the 
nilpotent Lie group G. 
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9.2. Connections with Lie Algebra. Let g be the Lie algebra of G, and let exp : g — t- G 
and log : G — 7- be the exponential and logarithm maps, which are both diffeomorphisms. 
We can also define the 1-parameter subgroup {g^)tm associated to an element g & G, and 
thus 

exp(X)* = exp(tX) 

for all X G and t G M. For an automorphism a of G we denote by da : — ?■ g the 
differential of a. Then we have 

a(exp(X)) = exp{(c/(T)(X)} for any X E Q. (9.1) 

These maps are illustrated below: 

G ^ G 
exp t i log 

9.3. Coordinates of the first and second kind. Now we give the notion of coordinates 
of the first and second kinds. Let X = {Xi, . . . , X^} be a basis for the Lie algebra g. If 

g = exp(uiXi + ■ ■ ■ + UrXr), 

then we say that . . . , Ur) are the coordinates of the first kind or exponential coordinates 
for g relative to the basis X. We write [ui, . . . , Ur) = ipexp{g)- If 

g = exp(t>iXi) ■ ■ ■ exp{vrXr), 

then we say that {vi, . . . ,Vr) are the coordinates of the second kind for g relative to X, 
and we write {vi, . . . ,Vr) = ip{g)- The height of a reduced rational number | is defined to 
be max{|a|, The basis X is said to be Q-rational if all the structure constants Cijk in 
the relations 

[Xj, Xj] = CijkXk 

k 

are rationals of height at most Q. 

The following lemmas describes the connection between the two types of coordinate 
systems; they are Lemmas A. 2 and A. 3 in Green- Tao [S]. 

Lemma 9.1. Let X he a basis for g such that 

[g,X,-] C Span(X,+i,...,X,) (9.2) 

for j = l,...,r — 1. Then the compositions tpe^p o ip'^ and ip o ip'Jp are both polynomial 
maps on W with bounded degree. If X is Q-rational then all the coefficients of these 
polynomials are rational of height at most Q*" for some constant G > 0. 
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Lemma 9.2. Let X be a basis for g satifying h9.2^) . Let x,y E G, and suppose that 
■ip{x) = (ui, . . . , Ur) and ip{y) = {vi, . . . ,Vr)- Then 

1pexp{x) = {Ui,U2 + -Ri(Mi), . . . , Mr + Rr-l{ui, . . .,Ur-l)), 

where each Rj : W ^ W is a polynomial of bounded degree. Also, 

1pe^p{xy) = (Mi + Vi,U2 +V2 + Si{Ui,Vi), ...,Ur + Vr + Sr~i{ui, . . .,Ur-l,Vi, . . .,Vr-l)), 

where each Sj : M.^ x W ^ W is a polynomial of bounded degree. 

Let Q > 2. If X is Q-rational then all the coefficients of the polynomials Rj,Sj are 
rationals of height for some constant C > 0. 
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